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Abstract. We define a de Bruijn process with parameters n and Las a 
certain continuous-time Markov chain on the de Bruijn graph with words 
of length L over an n-letter alphabet as vertices. We determine explicitly 
its steady state distribution and its characteristic polynomial, which 
turns out to decompose into linear factors. In addition, we examine the 
stationary state of two specializations in detail. In the first one, the de 
Bruijn-Bernoulli process, this is a product measure. In the second one, 
the Skin-deep de Bruin process, the distribution has constant density 
but nontrivial correlation functions. The two point correlation function 
is determined using generating function techniques. 



1. Introduction 

A de Bruijn sequence (or cycle) over an alphabet of n letters and of order 
L is a cyclic word of length n such that every possible word of length L over 
the alphabet appears once and exactly once. The existence of such sequences 
and their counting was first given by Camille Flye Sainte-Marie in 1894 for 
the case n = 2, see |FSM94) and the acknowledgement by de Bruijn[dB75], 
although the earliest known example comes from the Sanskrit prosodist 
Pingala's Chandah Shaastra (some time between the second century BCE 
and the fourth century CE [VN93, Knu06]). This example is for n = 2 and 
L = 3 essentially contains the word 0111010001 as a mnemonic for a rule in 
Sanskrit grammar. Omitting the last two letters (since they are repeating 
the first two) gives a de Bruijn cycle. Methods for constructing de Bruijn 
cycles are discussed by Knuth |Knu05| . 

The number of de Bruijn cycles for alphabet size n = 2 was (re-)proven 
to be 2 2 1 ~ L de Bruijn |dB46| . hence the name. The generalization to 
arbitrary alphabet size n was first proven to be n\ n ■ n~ L by de Bruijn 
and van Aardenne-Ehrenfest. This result can be seen as an application of the 
famous BEST-theorem [vAEdB51j ITS41} [Tut84] . which relates the counting 
of Eulerian tours in digraphs to the evaluation of a Kirchhoff (spanning-tree 
counting) determinant. The relevant determinant evaluation for the case of 
de Bruijn graphs (see below) is due to Dawson and Good |DG57j . see also 
|Knu67| . 



Date: August 30, 2011. 



1 



2 



ARVIND AYYER AND VOLKER STREHL 



The (directed) de Bruijn graph G n,L is defined over an alphabet E of 
cardinality n. Its vertices are the words of u = U\U2---U G a L , and 
there is an directed edge or arc between any two nodes u = u\U2 ■ ■ - ul and 
v = V1V2 ■ ■ ■ vl if an only if t(u) = u-i . . . u n = v± . . . u n -i = h(v), where h(v) 
(t(u) resp.) stands for the head of v (tail of u, resp.). This arc is naturally 
labeled by the word w = u.vl = u\.v, so that h(w) = u and t(v) = v. It is 
intuitively clear that Eulerian tours in the de Bruijn graph G n,L correspond 
to de Bruijn cycles for words over E of length L + 1 de Bruijn graphs and 
cycles have applications in several fields, e.g. in networking [KK03] and 
bioinformatics |PTW0l] , For an introduction to de Bruijn graphs, see e.g. 
[Ral82j . 

In this article we will study a natural continuous-time Markov chain on 
G n > which exhibits a very rich algebraic structure. The transition proba- 
bilities are not uniform since they depend on the structure of the vertices as 
words, and they are symbolic in the sense that variables are attached to the 
edges as weights. We have not found this in the literature, although there 
are studies of the uniform random walk on the de Bruijn graph [FKT88J. 
The hitting times |Chc03 and covering times [Mor92j of this random walk 
have been studied, as has the structure of the covariance matrix for the al- 
phabet of size n = 2 [AM04J and in general [Alh04j . The spectrum for the 
undirected de Bruijn graph has been found by Strok |Str92j . We have also 
found a similar Markov chain whose spectrum is completely determined in 
the context of cryptography [GG96j . 

After describing our model on G n,L for a de Bruijn process in detail in the 
next section, we will determine its stationary distribution in Section 3 and 
its spectrum in Section 4. In the last section we discuss two special cases, 
the de Bruijn-Bernoulli process and the Skin-deep de Bruijn process. 



2. The Model 

We take the de Bruijn graph G n,L as defined above. As alphabet we may 
take E = E n = {1,2, ... ,n}. Matrices will then be indexed by words over 
E n taken in lexicographical order. Since the alphabet size n will be fixed 
throughout the article, we will occasionally drop n as super- or subscript if 
there is no danger of ambiguity. 

From each vertex u = U1U2 ■ . . ul £ E L there are n directed edges in G n,L 
joining u with the vertices «2^3 • • • u n .a = t(u).a for a £ E. 

We now give weights to the edges of the graph G n,L . Let X = {x a ^ ; a E 
E, k > 1} be the set of weights, to be thought of as formal variables. An 
a-block is a word u G E + which is the repetition of the single letter a so that 
u = a k for some a G E and k > 1. Obviously, every word u has a unique 
decomposition into blocks of maximal length, 



(2.1) 
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where each factor feW is a block so that any two neighboring factors are 
blocks of distinct letters. This is the canonical block factorization of u with 
a minimum number of block-factors. 

We now define the function (3 : S + — > X as follows: 
- for a block a k we set /3(a fc ) 



(2.2) 



- for u E S + with canonical block factorization (2.1) we set ft(u) = 

i.e., the /3-value of the last block of u. 
An edge from vertex u G S L to vertex v G so that /i(t>) = i(u) with 
v = t(u).a, say, will then be given the weight /3(f). This means that 

x a ,L if P{u) = x a<L , 
x a ,k+i if P{u) = x a)k with k < L, 
x a> i if /3(u) = x^jfe for some b ^ a. 

Our rfe Bruijn process will be a continuous time Markov chain derived 
from the Markov chain represented by the directed de Bruijn graph G n,L 
with edge weights as defined above. The transition rates are f3{v) for tran- 
sitions represented by edges ending in v. We note that these rates can be 
taken just as variables and not necessarily probabilities. Similarly, expec- 
tation values of random variables in this process will be functions in these 
variables. 

The simplest nontrivial example occurs when n = L = 2. There are four 
configurations and the relevant edges are given in the Figure [l} 




Figure 1. An example of a de Bruijn graph in two letters 
and words of length 2. 

Before stating our notation for the transition matrix of a continuous-time 
Markov chain, our de Bruijn process, we need a general notion. 

Definition 1. For any kxk matrix M , let V M denote the matrix where the 
sum of each column is subtracted from the corresponding diagonal element, 

(2.3) v M = M-diag(l fe -M), 
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where lk denotes the all-one row vector of length n and diag(mi, . . . , rut) is 
a diagonal matrix with entries mi, . . . , m^ on the diagonal. 

In graph theoretic terms V M is the (negative of) the KirchhojJ matrix or 
Laplacian matrix of G, if M is the weighted adjacency matrix of a directed 
graph G. In case M is a matrix representing transitions of a Markov chain, 
the column (or right) eigenvector of V M for eigenvalue zero properly nor- 
malized gives the stationary probability distribution of the continuous-time 
Markov chain. 

We note that the graphs G n,L are both irreducible and recurrent, so that 
the stationary distribution is unique (up to normalization). We will use 
M n,L to denote the transition matrix of our Markov chain, 



(2.4) 



n,L 



v.u 



rate(u 



^M n ' L is then precisely the transition matrix, 



(2.5) 



n,L 



/3(v) 



Piy). 

for u ^ v , 
for u = v. 



For the example in Figure [TJ with lexicographic ordering of the states, 



(2.6) 



M 



2,2 



V 



-£2,1 
X2,l 










Sl,2 
-Si i2 - X 2 ,l 








xi,i 

-xi,i J 



-Xl } i - X 2 ,2 

^1,1 

^2,2 

The stationary distribution is given by probabilities of words, which are to 
be taken as rational functions in the variables x a t- It is the column vector 
with eigenvalue zero, which after normalization is then given by 

(2.7) 

Pr[l, 1] = ? ■ Xl f'> 2 ■ -, Pr[l,2] 



^2,1^1,1 



Pr[2, 1] 



(Xl,2+X 2 ,l) {Xl,l +X 2 ,l] 
^2,1^1,1 



Pr[2,2] 



(xi t i + 2:2,2) (xi,i +x 2 ,i)'' 
x 2 , 2 x 2tl 



(xi >2 + x 2j i) (xi,i + x 2t i) ' ' (x Xt i + x 2j2 ) {xi,i + x 2t i) ' 

Notice that the probabilities consist of a product of two monomials in the 
numerator and two factors in the denominator, and that each factor con- 
tains two terms. Also, notice that not all the denominators are the same, 
otherwise the steady state would be a true product measure. Of course, the 
sums of these probabilities is 1, which is not completely obvious. 

It is also interesting to note that the eigenvalues of V M 2 ' 2 are linear in 
the variables. Other than zero, the eigenvalues are given by 

(2.8) - xi t i — x 2j2 , —^1,1 — ^2,1, and - 21,2 - ^2,1- 

Another way of saying this is that the characteristic polynomial of the tran- 
sition matrix factorizes into linear parts. 
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3. Stationary Distribution 

In this section we determine an explicit expression for the steady state 
distribution of the de Bruijn process on G n,L . Before we do that we will 
have to set down some notation. For the moment we are working over S + , 
the set of all nonempty words over the alphabet E (of size n). 

For convenience, we introduce operators which denote the transitions of 
our Markov chain. Let d a be the operator that adds the letter a to the end 
of a word and removes the first letter, 

(3.1) d a : u H> t(u).a. 

With (3 as introduced we introduce the shorthand notation 

(3.2) Pa,m = Y,^ am ) = X a,m+ £ X b , X . 

Note that /3 Q) i = X^es x b,i does not depend on a. We now define the 
valuation //(«) for u G S + as 

(3.3) fi{u) - 



Note that the restriction of \i on the alphabet S is (formally) a probability 
distribution. Finally, we define the valuation p,, also on E + , as 

L 

(3.4) fi(u) = Y\_K u i u 2 ■■■u i ) = ^.(■ui)/i(niu 2 ) • ■■n{uiU2 . ..u L ), 

i=l 

if u = u\U2 ■ ■ - UL- The following result is the key to understanding the 
stationary distribution. 

Proposition 1. For all u G E + , 

(3.5) ^2fi(a.u) = p,(u). 



Proof. As in (2.1), let us write w in block factorized form: 

(3.6) u = b^b^ ---b^ = w.b {m \ 

where u = b^ . . . fc^ -1 ) if m > 1, and u is the empty word if m = 1. 
If &M = a fc , then 



(3.7) /i(u) 



( a^fc ■£ m _ x ; i.e., if u is a block, 
a ' k if m > 1, 

> Pa,fc+1 
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and thus 



( k 



(3.8) p,(u) = < 



n 



if m = 1, i.e., if u is a block, 



' TT ~ if m > 1. 



We will define another valuation on S + closely related to fi, which we call 



p. Referring to the factorization (3.6) we put 

( k 



(3.9) p(u) = { 



TT a ' 3 if m = 1, i.e., if u = a k is a block, 

f = l PaJ+l 



J]p(« (i) ) ifm>l. 



This new valuation is related to p by the following properties: 
- For blocks u = a k we have 



(3.10) 



P(a k ) = #^(a fc ), 



Pa. 



fe+1 



For u with factorization (3.6) we have 



(3.H) -p{u) = m-p(p [m) ), 

- which, by the obvious induction, implies 



(3.12) 



1=2 



We are now in a position to prove identity (3.5). First consider the case 
where u = a k is a block. 



1 /lid 1, ) ■ 5>(6)-p(a*) 



(3.13) 



X a ,k+l -,k\ 



Xb ' 1 -»(a k ) 



x a,k+l 



+E 



356,1 



Ai,fc+1 P 



/2(a fc ) 
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where we used (3.10) in the last-but-one step. The general case is then 
proven by a simple induction on m. 

£ ji(a.bV>bW . . . b™) = Y, A(o-6 (1) 6 (2) • • • b {m - 1] ) ■ P(b {m) ) 

( 3 - 14 ) = f i^ 1 ¥ 2 K..b^)-p(b^) 

= /2(&«&( 2 )...&M), 
where we have used property ( 3.11[ ) of p in the last step. □ 

As a consequence of Proposition [TJ we have the following result, which 
is an easy exercise in induction. The case L = 1 was already mentioned 
immediately after (3.3). 

Corollary 2. For any fixed length L of words over the alphabet X, 
(3.15) Y, = L 

Therefore, the column vector fi n ' L = [yw(w)] ngS z, can be a seen as a formal 
probability distribution on £ . We now look at the transition matrix M n,L 
more closely. 

(3-16) = S h(v)=t{u) (3(v). 

where S x is the indicator function for x, i.e., it is 1 if the statement x is 
true and otherwise. Thus the matrix M n,L is very sparse. It has just 
n off-diagonal non-zero entries per row and per column. More precisely, 
the row indexed by v has the entry /3(v) for the n 9-preimages of v, and 
the column indexed by w contains (3{d a v) as the only nonzero entries. In 
particular, the column sum for the column indexed by u is Xlaes P(d a (u)). 
Define the diagonal matrix A n,L as one with with precisely these column 
sums as entries, i.e. 

( 317 ) A n,L _ /Eaes/ 3 ^) v = u > 



v,u 



otherwise. 



Theorem 3. The vector fi n,L is the stationary vector for the de Bruijn 
process on G n,L , i.e., 

(3.18) M n ' L [i n ' L = A n ' L fi n ' L . 

Proof. Consider the row corresponding to word v = v±V2 ■ ■ ■ vl^\Vl = h(v).vi 
in the equation 

(3.19) MJZ = AJI. 



On the l.h.s. of (3.19) we have to consider the summation X^«es L M v>u p-(u), 
where only those uu G T, L with t{u).VL = v contribute. This latter condition 
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can be written as u = b.h{v) for some b £ S, so that this summation can be 
written as 

M VjU Jl{u)b = Mv,b.h(v) V-{b-h{v)) 

(3.20) 

= f3(v)Y / JL(b.h(v)) = f3(v)]l(h(v)), 
fees 

where the last equality follows from Lemma 1. 



On the r.h.s. of (3.19) we have for the row entry corresponding to the 
word v. 



A„,„ n(v) = ^ (3(d a v) n(v) 

aeS 

P(d a v) ■ HHv)) IM(V) = P{V) Ti(h(v)) 



(3.21) " T " : 



in view of the inductive definition of \i in (3.4) and the definition of /j, in 



d3.3[ ). □ 

Let Z n ' L denote the common denominator of the stationary probabilities 
of configurations. This is often called, with some abuse of terminology, 
the partition function [BE071. The abuse comes from the fact that this 
terminology is strictly applicable in the sense of statistical mechanics while 
considering Markov chains only when they are reversible. The de Bruijn 
process definitely does not fall into this category. Since the probabilities are 



given by products of fi in (3.4), one arrives at the following product formula. 

Corollary 4. The partition function of the de Bruijn process on G n,L is 
given by 

L-l n 

(3.22) Z"< L = /V • \{ \{ (3 a , m . 

m=2 a=l 

Physicists are often interested in properties of the stationary distribution 
rather than the full distribution itself. One natural quantity of interest in 
this context is the so-called density distribution of a particular letter, say a, 
in the alphabet. In other words, they would like to know, for example, how 
likely it is that a is present at the first site rather than the last site. We 
can make this precise by defining occupation variables. Let rf> % denote the 
occupation variable of species a at site %: it is a random variable which is 1 
when site i is occupied by a and zero otherwise. We define the probability in 
the stationary distribution by the symbol ( • ). Then ( rf ,% ) gives the density 
of a at site i. Similarly, one can ask for joint distributions, such as ( rf^rf^ ), 
which is the probability that site i is occupied by a and simultaneously that 
site j is occupied by b. Such joint distributions are known as correlation 
functions. 

We will not be able to obtain detailed information about arbitrary corre- 
lation functions in full generality, but there is one case in which we can easily 
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give the answer. This is the correlation function for any letters a^, . . . , 0&2, 01 
at the last k sites. 



Corollary 5. Let u = . . . 0201. Then 
(3.23) ^k,L-k+l . . . ^L-l^a, 

Proof. By definition of the stationary state, 
(3.24) 



Using Proposition [T] repeatedly L — k times, we arrive at the desired result. 

□ 



In particular, Corollary [5] says that the density of species a at the last site 
is simply 

(3.25) {rf> L ) = X f±. 

Pa,l 

Formulas for densities at other locations are much more complicated. It 
would be interesting to find a uniform formula for the density of species a 
at site k. 

4. Characteristic Polynomial of v M n,L 

We will prove a formula for the characteristic polynomial of ^M n,L in 
the following. In particular, we will show that it factorizes completely into 
linear parts. In order to do so, we need to understand the structure of 
the transition matrices better. We denote by x(M; A) the characteristic 
polynomial of a matrix M in the variable A. 

To begin with, let us recall from the previous section that the transition 
matrices M n ' L , taken as mappings defined on row and column indices, are 
defined by 

(4.1) 



M n ' L : Sf x 



X : (v,u) ^ 5 h(v)=t(u) ■ /3(v). 
Lemma 6. The matrix M n,L can be written as 



(4.2) 



M 



n.L 



n,L 1 j^n,L 



A n ' L ] (n copies of A n ' L ) 



where A n ' L is a matrix of size n L x n L 1 given by 



(4.3) 

We have 
(4.4) 

A n,l 



j^n,L . yjn,L ^ yjn,L— 1 



XU{0} : (v,u)^6 h(v)=u -p(v). 



A 



n,L 



A 



n,L-l 



~\n,L— 1 



yn,L— 1 



yn,L— 1 



A 



n,L-l 



yi,L—l 
ya,L—l 



-\n.L-l 



A 



n,L—l 



B 



n,L-l 



B 



n,L-l 
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where A^ L 1 is like A n,L 1 , but with Xk,L-\ replaced by Xk t L, and where 
Qn,L-i j g zero ma trix f s j ze j^ -1 x n L ~ 2 . The matrices Ba' L ~ l are 
square matrices of size n i_1 x n L ~ l , where for each a£E the matrix B™ ,L 
is defined by 

(4.5) B^ L : S L x S L -> X U {0} : ( V , u) ^ <y o . h(t))=u • fta.w). 

With these matrices at hand we can finally define the matrix B n ' L = 2^ a eE Ba ,L 
of size n L x n L , so that 

(4.6) B n ' L : S L x S L -> X U {0} : («, u) m- <5 h(t))=t(tt) • £(«!.«). 
Lemma 7. M™' 1 — ,B n > L is a diagonal matrix. 

Proof. We have 

(4.7) M n ' L (v,u) / B n ' L (v,u) = t(u) and /3(«i.u) / 

But f3{ui.v) can only happen if the last block of u\.v is different 

from the last block of v, which only happens if v itself is a block, v = a L , 
and u\ = a, in which case (3(v) = x a> L and f}(u\.v) = x a L+i- So we have 



(4.8) {B n ' L -M n ' L )(v,u) 



n,L\ 



X a ,L+l ~ %a,L M V = U = a L , 







otherwise. 



□ 



We state as an equivalent assertion: 

Corollary 8. For the Kirchhoff matrices of M n,L and B n ' L we have equality: 

(4.9) v M n ' L = v 5 n ' L . 

We now prove a very general result about the characteristic polynomial 
of a matrix with a certain kind of block structure. This will be the key to 
finding the characteristic polynomial of our transition matrices. 



Lemma 9. Let Pi, ... , P m , Q be any kx k matrices, P = Pi + ■ — h P m and 
(4.10) R = 



Pl + Q P2 
Pi P2 + Q 



p 

1 n 

Prr 



Pi 



P2 



Q 



Then 

(4.ii) X (R; A) = X (Q; A)™' 1 • x(P + Q; A). 
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Proof. Multiply R by the block lower-triangular matrix of unit determinant 
shown to get 



(4.12) 



R 



1 
-1 1 
-1 1 







Q 

-Q Q 

-Q Q 





Pi 

P-2 

P m + Q 



which has the same determinant as R. Now perform the block row operations 
which replace row j by the sum of rows 1 through j to get 



(4.13) 



Q 

Q 

Q 





Pi 
P1 + P2 
P1+P1 + P 3 

P + Q 



Since this is now a block upper triangular matrix, the characteristic poly- 
nomials is the product of those of the diagonal blocks. □ 

We will now apply this lemma to the block matrix 



(4.14) 



D 



n,L 



jjn,L 



n,L 



B. 



J->n 



B n,L 
B n,L _ D n, L 



D 
D. 



n,L 
1 

n,L 



r>n,L 



B n,L _ D n,L 



where D n,L is the (n L x n L )-diagonal matrix with the column sums of A n,L+1 
on the main diagonal. 

Proposition 10. The characteristic polynomials xC^M n,L ; z) satisfy the 
recursion 

(4.15) X( V M"< L+1 ; z) = X (-D n > L ; z)^ 1 ■ X ( V M^; z). 

Proof. From Corollary [8j Lemma [9J and the easily checked fact ^B n,L = 
B n ' L - D n ' L we get: 

x( V M n,L+l. A) = x (- D n,L. A) n-1 . B n,L _ ^ A) 

= x(-D n > L ;\r- 1 -x(B n > L -D n > L ;\) 
= X (-D n ' L ;\) n - 1 -xC / B n > L ;\) 
= X (-D n > L ;\r- 1 -xC / M n > L ;\). 

□ 



(4.16) 



As a final step, we need a formula for x{—P >n ' L , A). 
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Lemma 11. The characteristic polynomial of —D n ' L is given by 
(4.17) 

'A + 01,1 ifL 



-D n ' L , A) 



1, 



1,-1 

n ri( A +^ 

, m=2 aSS 



\ (n— l)n £ 



Y[(\ + Pa,L) ifL>l. 



Proof. The case L = 1 follows directly from the definition of A n ' in (4.4). 
For general L, recall that A n,L contains n copies of A n ~ 1,L with one factor 
containing x a ,L-i removed and one factor containing x a ^ added instead, for 
each aSS. Thus, 



(4.18) 



-D n < L ,X)=[ x (-D n > L - 1 ,X)] n -l[ 



A + P a ,L 
A + Pa,L-l 



which proves the result. 



□ 



We can now put everything together and get from Proposition 10, Lemma 
11 and checking the initial case for L = 1: 



Theorem 12. The characteristic polynomial of the de Bruijn process on 
G n ' K is given by 



(4.19) 



X { w M n > L -\) 



A (A + /3i,i). J] i[(\ + Pa, m ) (n - 1)nl 

m=2 aeS 



5. Special cases 

We now consider special cases of the rates where something interesting 
happens in the de Bruijn process. 

5.1. The de Bruijn-Bernoulli Process. There turns out to be a special 
case of the rates x a j for which the stationary distribution is a Bernoulli 
measure. That is to say, the probability of finding species a at site i in 
stationarity is independent, not only of any other site, but also of i itself. 
This is not obvious because the dynamics at any given site is certainly a 
priori not independent from what happens at any other site. Since the 
measure is so simple, all correlation functions are trivial. We denote the 
single site measure in (3.3) for this specialized process to be fi y , and the 
stationary measure (3.4) as p, y . 

Corollary 13. Under the choice of rates x a j = y a independent of j , the 
stationary distribution of the Markov chain with transition matrix ^M n ' L is 
Bernoulli with density 

Va 



(5.1) 



Pa 
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Proof. The choice of rates simply mean that species a is added with a rate 



independent of the current configuration. From (3.3), it follows that for 
U = U\U2 . ■ ■ u L , 

(5-2) n v (u) = Vul = p UL , 



and using the definition of the stationary distribution p in (3.4), 

L 

(5-3) fty(u) = Y[pui, 

i=i 

which is exactly the definition of a Bernoulli distribution. □ 

5.2. The Skin-deep de Bruijn Process. Another tractable version of 
the de Bruijn process is one where the rate for transforming the word u = 
u\U2 ■ ■ ■ ui into d a u = t(u).a = u-i . . . u^.a for a £ £ only depends on the 
occupation of the last site, ul- Hence, the rates are only skin-deep. An 
additional simplification comes by choosing the rate to be x when a = ul 
and 1 otherwise. Namely, 



(5.4) 



x for j = 1, 
1 for j > 1. 



We first summarize the results. It turns out that any letter in the alphabet 
is equally likely to be at any site in the skin-deep de Bruijn process. This 
is an enormous simplification compared to the original process where we do 
not have a general formula for the density. Further, we have the property 
that all correlation functions are independent of the length of the words. 
This is not obvious because the Markov chain on words of length L is not 
reducible in any obvious way to the one on words of length L — 1. This 
property is quite rare and very few examples are known of such families of 
Markov chains. One such example is the asymmetric annihilation process 
[5SI0| . 

The intuition is as follows. By choosing x <C 1 one prefers to add the 
same letter as u^, and similarly, for x> 1, one prefers to add any letter in 
E other than ul. Of course, x = 1 corresponds to the uniform distribution. 
Therefore, one expects the average word to be qualitatively different in these 
two cases. In the former case, one expects the average word to be the same 
letter repeated L times, whereas in the latter case, one would expect no two 
neighboring letters to be the same on average. Our final result, a simple 
formula for the two-point correlation function, exemplifies the different in 
these two cases. 

We begin with a formula for the stationary distribution, which we will 
denote in this specialization by pZ x . We will always work with the alphabet 
£ on n letters. 
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Lemma 14. The stationary probability for a word u = u\U2 ■ ■ - Ul £ S L is 
given by 

( 5 - 5 ) ArO) = , 7 TYTt^I' 

n(l + (n — ljx)^ 1 

where ^{u) is the number of blocks of u. 

Proof. Analogous to the notation for the stationary distribution, we denote 
the block function by (3 X . From the definition of the model, 



(5.6) /3 x (a h 



if k = 1, 
if k > 1. 



and thus, for any word u the value f3 x {u) is x if the length of the last block 
in its block decomposition is 1, and is 1 otherwise. The denominator in (5.5) 
is easily explained. For any word u of length L, 

^ n i.i \ a J 1 + (n - l)x L > 1, 

(5.7) 22p x (t(u).a)= { ' 

l nx L = 1 ' 

because for all but one letter in E, the size of the last block in t(u).a is going 
to be 1. The only exception to this argument is, L = 1, when i(u) is empty. 
From (3.4), we get 

(5 - 8) = nx(l + (n - 1)^)^1 ' 

The numerator is x 7 ^, since we pick up a factor of x every time a new 
block starts. One factor x is cancelled because j3 x {u\) = x. □ 

The formula for the density is essentially an argument about the symme- 
try of the de Bruijn graph G n,L . 

Corollary 15. The probability in the stationary state of G n,L that site i is 
occupied by letter a is uniform, i.e., for any i s.th. 1 < i < L we have 

(5.9) = - (a E £). 

n 

Proof. Indeed, by Lemma [Tl] the stationary distribution p, x is invariant un- 
der any permutation of the letters of the alphabet S. Hence {n a '' 1 ) does not 
depend on a € S and we have uniformity. □ 

Since the de Bruijn-Bernoulli process has a product measure, the density 
of a at site i is also independent of i, but the density is not uniform since 
it is given by p a (5.1). The behavior of higher correlation functions here is 
more complicated than the de Bruijn-Bernoulli process. There is, however, 
one aspect in which it resembles the former, namely: 

Lemma 16. Correlation functions of G n,L in this model are independent of 
the length L of the words and they are shift-invariant. 
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Proof. We can represent an arbitrary correlation function in the de Bruijn 
graph G n ' L as 

(5.10) ( v ai ' h ■■■ v a ^) L = flx(w m a lW {1) ...w^a k w^), 

uj(°) ,...,w( k) 

where we have sites 1 < i\ < 12 < ■ ■ ■ < i k < L and letters a\, 02 5 • • • > &k € S, 
and where the sum runs over all (w^°\ . . . , w^ k ') with G S 4s+1_ * s_1 
for s G {0, . . . , k}, and where we put iq = and = L + 1. Now note 
that we have from Proposition [l] for any u G S fc 

(5.11) p>x(w-u) = Ux{u). 



Since fi x , as given in Lemma 14, is also invariant under reversal of words, 
we also have 

(5.12) p>x(u.w) = fjx(u). 



As a consequence, we can forget about the outermost summations in (5.13) 
and get 

(5.13) {r] ai > h ■■■r) ak > ik ) L = 

Y, Maiw {1) . . . w^a k ) = ( V a ^ ■ ■ ■ v ak ' Jk )i k ^ 1+ u 

wW,...,w( k - 1) 

where j s = i s — i\ + 1 (1 < s < k). Shift-invariance in the sense that 

(5.14) {r] ai ' h ■ ■ ■ rj ak ' ik ) L = (r/ ai ' il+1 • • • T} ah > ih+1 ) L 

is an immediate consequence. □ 

We now proceed to compute the two-point correlation function. This is 
an easy exercise in generating functions for words according to the number 
of blocks. The technique is known as "transfer-matrix method", see, e.g., 
Section 4.7 in |Sta97| . 

For a, b G £ and k > 1 we define the generating polynomial in the variable 

x 



(5.15) a ntk (a,b;x) = ^ 



x 



7(10)—! 



ea.S fe - 1 .b 



where, as before, 7(10) denotes the number of blocks in the block factoriza- 
tion of w G H + (so that "f{w) — 1 is the number of pairs of adjacent distinct 
letters in w). Note that 



(5.16) a n> i(a,b;x) ■ 

The following statement is folklore: 



1 if a = b, 
x if a 7^ b. 
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Lemma 17. Let I n denote the identity matrix and J n denote the all-one 
matrix, both of size n x n, and let K n (s, t) := s ■ I n + t ■ J n for parameters 
s,t. Then 

(5.17) K n (s,t)- l = - - K n (s + nt,-t). 

s{s + nt) 

Indeed, this is a very special case of what is known as the Sherman- 
Morrison formula, see |SM50| . [Wil59]. 
Consider now the matrix 

(5.18) A n (x) := [a n ^(a,b]x)] ab& = (1 - x) ■ l n + x ■ JJ n = K n {l - x,x) 

which encodes transition in the alphabet S. Then, for k > 1, A n (x) k is an 
(n x n)-matrix which in position (a, b) contains the generating polynomial 
a n ,k(a, b;x): 



(5.19) A n (x) k = [a ntk (a,b;x) 



We can get generating functions by summing the geometric series and using 
Lemma [TTJ 

Y,Mxfz k = (I n - Z ■ A^X))- 1 
k>0 

(5.20) = K n (l - z + xz, -xz)' 1 

K n (l — z — (n — l)xz, xz) 



(1 — z + xz)(l — z — (n — l)xz) 
which means that that for any two distinct letters a, b 6 X: 



y^«n,fc( 



a, a: x) z k 



1 — z — (n — 2)xz 



k>0 



(1 — z + xz)(l — z — (n — l)xz) 

1 1 n- 1 1 

+ 



(5.21) 



y^a n ,fc(a, b;x)z k 



k>l 



n 1 — z — (n — 1) xz n 1 — z + xz' 
zt 

(1 — z + xz)(l — z — (n — l)xz) 

11 11 

n 1 — z — (n — 1) xz n 1 — z + xz'' 

or equivalently, 

a n , fe (o, a; x) = - ((1 - (n - l)x) k + (n - 1)(1 - x) k ) , 
(5.22) J V 7 

a n>k {a, b; x) = - ((1 - (n - l)x) k - (1 - x) fe ) . 

We thus arrive at expressions for the two-point correlation functions: 
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Proposition 18. For a, b G £ with a ^ b and 1 < i < j < L, 

If 1 - <- 



1 re 



(5.23) 



71 



3' 



71" 



1 + (re 
1 - x 



l)x 

r 



1 + (re - l)x 



Proof. By Lemma 16 
14 



we may assume i = 1 and j = L. Comparing Lemma 



(5.24) 



with the definition of the a Ut k(a, b; x) in (5.15) we see that for a, b 6 S: 

an,L-l(fl,^) 



re(l + (re 

so that the assertion follows from 15.221 



lk 



□ 



The formula (5.23) is quite interesting because the first term, 1/re , has 



a significance. From the formula for the density in Corollary 15, we get 

L-l 

(5 25) ' ■ * 1 ■ : />■■"•■'■ \ / ' •" 



n 



1 



n- 



1 



lk 



1 + (re 

The object on the left hand side is called the truncated two point correlation 
function in the physics literature, and its value is an indication of how far the 
stationary distribution is from a product measure. In the case of a product 
measure, the right hand side would be zero. Setting 

(5.26) : ~ 



a 



1 



re 



IV 



we see that \a\ < 1, and so the truncated correlation function goes expo- 
nentially to zero as L — > oo. Thus, the stationary measure fi x behaves like 
a product measure if we do not look for observables which are close to each 



other. We can use (5.25) to understand one of the differences between the 



values x < 1 and x > 1, namely in the way this quantity converges. In the 
former case, the convergence is monotonic, and in the latter, oscillatory. 
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